Gene expression data mining guided by genomic background knowledge
نویسندگان
چکیده
Microarray data represents valuable information resources, nevertheless the knowledge is hidden inside the data and it is not easy to mine. Background knowledge is also stored in various formats and it is challenging to automatically infer the biological meaning from existing repositories. This paper deals with a new gene-expression knowledgefusion system that combines molecular biology data from various sources — the experiment in hand, gene expression data from similar experiments stored in array expression databases, additional knowledge on the most significant genes and their products from specialised services (e.g., pathway databases), and automatically derived results provided by relevant scientific literature. The design of the proposed system is rather complex. We take advantage of recent semantic web technologies to integrate the various modules of the system. Some of the components described in the paper have already taken part in the end-user applications, others still wait for their implementation in the form of software tools.
منابع مشابه
Data Mining for Identification of Forkhead Box O (FOXO3a) in Different Organisms Using Nucleotide and Tandem Repeat Sequences
Background: Deregulation of FOXO3a gene which belongs to Forkhead box O (FOXO) transcription factors, can cause cancer (e.g. breast cancer). FOXO factors have important role in ubiquitination, acetylation, de-acetylation, protein-protein interactions and phosphorylation. Understanding the regulation and mechanisms of FOXO3a can lead to cancer treatment. The aim of this study recent association...
متن کاملClinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to b...
متن کاملA hypergraph-based learning algorithm for classifying gene expression and arrayCGH data with prior knowledge
MOTIVATION Incorporating biological prior knowledge into predictive models is a challenging data integration problem in analyzing high-dimensional genomic data. We introduce a hypergraph-based semi-supervised learning algorithm called HyperPrior to classify gene expression and array-based comparative genomic hybridization (arrayCGH) data using biological knowledge as constraints on graph-based ...
متن کاملADAM Gene Expression in The Adult CNS and Genetic Aberrations in Cancer Cells
ADAM metalloprotease-disintegrins share a common modular structure of functional domains for proteolytic, cell adhesion, and signaling interactions. The metalloprotease domain of oughly half of the known ADAMs contain an intact consensus metzincin catalytic site, and they are thus thought to function as active metalloproteases. The types of interactions mediated by ADAMs are expressly conspicu...
متن کاملO-3: Drug Repositioning by Merging Gene Expression Data Analysis and Cheminformatics Target Prediction Approaches
The transcriptional responses of drug treatments combined with a protein target prediction algorithm was utilised to associate compounds to biological genomic space. This enabled us to predict efficacy of compounds in cMap and LINCS against 181 databases of diseases extracted from GEO. 18/30 of top drugs predicted for leukemia (e.g. Leflunomide and Etoposide) and breast cancer (e.g. Tamoxifen a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008